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Abstract 

The human braingraph, or connectome is a description of the connections of 
the brain: the nodes of the graph correspond to small areas of the gray mat¬ 
ter, and two nodes are connected by an edge if a diffusion MRI-based workflow 
finds fibers between those brain areas. We have constructed 1015-vertex graphs 
from the diffusion MRI brain images of 392 human subjects and compared the 
individual graphs with respect to several different areas of the brain. The inter¬ 
individual variability of the graphs within different brain regions was discovered 
and described. We have found that the frontal and the limbic lobes are more 
conservative, while the edges in the temporal and occipital lobes are more di¬ 
verse. Interestingly, a “hybrid” conservative and diverse distribution was found 
in the paracentral lobule and the fusiform gyrus. Smaller cortical areas were 
also evaluated: precentral gyri were found to be more conservative, and the 
postcentral and the superior temporal gyri to be very diverse. 


1. Introduction 

Large co-operative research projects, such as the Human Connectome 
Project [1], produce high-quality MRI-imaging data of hundreds of healthy in¬ 
dividuals. The comparison of the connections of the brains of the subjects is a 
challenging problem that may open numerous research directions. In the present 
work we map the variability of the connections within different brain areas in 
392 human subjects, in order to discover brain areas with higher variability in 
their connections or other brain regions with more conservative connections. 

The braingraphs or connectomes are the well-structured discretizations of 
the diffusion MRI imaging data that yield new possibilities for the compari¬ 
son of the connections between distinct brain areas in different subjects [2, 3] 
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or for finding common connections in distinct cerebra [4], forming a common, 
consensus human braingraph. 

Here, by using the data of the Human Connectome Project [1], we describe, 
by their distribution functions, the inter-individual diversity of the braingraph 
connections in separate brain areas in 392 healthy subjects of ages between 22 
and 35 years. 

Since every brain is unique, the workflow that produces the braingraphs 
consists of several steps, including a diffeomorphism [5] of the brain atlas to 
the brain-image processed. After the diffeomorphism, corresponding areas of 
different human brains are pairwise identified through the atlas and, conse¬ 
quently, can be compared with one another. The braingraphs, with nodes in 
the corresponded brain areas, are prepared from the diffusion MRI images of the 
individual cerebra through a workflow detailed in the “Methods” section. Every 
braingraph studied contains 1015 nodes (or vertices). The vertices correspond 
to the subdivision of anatomical gray matter areas in cortical and subcortical 
regions. For the list of the regions and the number of nodes in each region, we 
refer to Table SI and Figure SI in the Appendix. 

Next, we describe the variability, or the distribution of the graph edges in 
each brain region, and also in each lobe. Figure 1 contains a simplified example 
on three small graphs (1,2,3) each with only two regions (A & B). The example 
clarifies the method, the way the results are presented through a distribution 
function, and the diagrams describing these functions. 

For any fixed brain area, and for any a: : 0 < a; < 1, let F(x) denote 
the fraction of the edges^ in the fixed area^ that are present in at most the 
fraction x of all braingraphs, (for a more exact definition of F{x) we refer to the 
“Methods” section). We note that F{x) is a cumulative distribution function 
[6] of a random variable described in the “Methods” section. 

2. Results and Discussion 

Table 1 summarizes the edge diversity results for the 392 graphs for the lobes 
of the brain, described by the distribution functions F{x). The last column 
contains the data for the whole brain with 1015 nodes and 70,652 edges. The 
sum of the edges of the lobes in Table 1 is 30,326: these edges have both 
endpoints in the same lobe. More than forty thousand edges are present and 
accounted for only in the last column, because these edges connect nodes from 
different lobes. Therefore, the values in the last column cannot be derived from 
the other columns, since that column contains the contribution of edges that do 
not contribute to any other columns. 

We want to find out which brain areas are more conservative and which are 
more diverse than the others. We suggest to designate an area as “conservative” 


^i.e., the number of the edges in question, divided by the number of all edges in the fixed 
area; 

^i.e., with both vertices in the fixed area; 
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X, the fractioo of graphs 


Figure 1: A simple example of computing the edge distribution between brain areas. In the 
example, there are three “braingraphs”, each with two areas: A and B. We intend to count 
the edges that are present in all three graphs, only in two graphs and only in a single graph, 
respectively (between the same nodes, but in different graphs). For example, the copies of 
edge e are present in all three A areas, copies of edge h in all three B areas, copies of edge 
g in two B areas and edge / is present only in SI. The edges crossing the boundary of A 
and B (colored green) are ignored when counting the edge distribution within the areas A 
and B. In area A, two edges are present once, two edges twice and also two edges (including 
edge e) exactly three times. In area B, two edges (including /) are present once, four edges 
(including g) twice and one edge - h — three times. In the diagram on the bottom, we give 
the F{x) distribution functions for both areas. On axis x, the fractions of the graphs are 
given, 1/3 correspond to one graph, 2/3 for two and 1.0 for all three graphs. F{x) is defined 
as the fraction of the edges in the fixed area that are present in at most the fraction x of all 
braingraphs. Data points corresponding to area A are on the same blue line (1/3, 2/3, 1) and 
those, corresponding to area B are on the broken, red line (2/7, 6/7, 1). We remark that if all 
three graphs are the same, then the data points are (0,0,1) (the extremely conservative case, 
orange line). Similarly, if no two graphs have the same edges, the data points are (1,1,1) (that 
is the extremely diverse case, green line). This type of diagram is used for the presentation of 
the results of the distribution of the edges in separate areas of the brain: The faster the line 
reaches the top F{x) = 1 value, the more diverse is the edge set in the corresponding brain 
area. We also note that in the diagram the lines connect the data points corresponding to the 
discrete values on axis x, and do not describe the step-function F{x) between the data points: 
we have chosen this visualization method because of its clarity even if a higher number of 
areas are shown (c.f. Figures 2 and 3 with numerous crossing lines). 


if for most x values, its F{x) distribution function is less than the F{x) of the 
all brain, given in the last column. We also suggest to designate an area as 
“diverse” if for most x values, its F{x) distribution function is greater than the 
F(x) of the all brain, given in the last column. 

The most conservative lobes are the smallest ones: the brainstem, the tha- 
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lamus and the basal ganglia contain only 1, 2 and 8 nodes, resp., and most of 
the edges in those regions are present in almost all braingraphs. If we take the 
average number of the braingraphs containing an edge from those regions, we 
get 316, 390 and 213 graphs, resp. 

It is much more interesting to review the diversity of the connections in 
larger areas. The frontal and the limbic lobes are conservative for most values 
of X (i.e., their F{x) values are less than that of the last column), while the 
temporal and the occipital lobes are diverse for larger x’s. The distribution of 
the edges in the fusiform gyrus is particularly interesting: more than 10% of the 
graphs contain 46% of the edges which means this is a conservative brain area 
in that parameter domain, compared to the other lobes. The fusiform gyrus 
remains conservative for x = 0.2 and even for x = 0.3, but more than 50% 
of the graphs contain only 0.7% of the edges. That means that some edges of 
the fusiform gyrus are well conserved, and some parts are very diverse. The 
paracentral lobule has a very similar distribution. 
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Table 1: The number of nodes, the number of edges and the diversity of the edges in different 
lobes, measured by the distribution function F{x). The list includes some brain areas that 
usually are not counted as lobes: like the fusiform gyrus, basal ganglia, and the paracentral 
lobule. The lobes, whose columns reach the value 1 faster (i.e. have more I’s at the bottom) 
have higher diversity. For example, the frontal and the limbic lobes are more conservative, 
while the temporal and the occipital lobes are more diverse. The distribution of the edges in 
the fusiform gyrus is particularly interesting: more than 10% of the graphs contain 46% of 
the edges which means this is a conservative brain area in that parameter domain, compared 
to the other lobes. The fusiform gyrus remains conservative for x = 0.2 and even for x = 0.3, 
but more than 50% of the graphs contain only 0.7% of the edges. Therefore, some edges of 
the fusiform gyrus are well conserved, and some other parts are very diverse. The paracentral 
lobule has a similar distribution. The data are also visualized on Figure 2 and an interactive 
figure http: //uratim. coin/diversity/Figure_2.html 


Table 2 summarizes the diversity results for those cortical areas which have 
more than 222 edges (see Table S2 in the Appendix for the edge numbers). 
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Figure 2: The diversity of the edges in different lobes, measured by the distribution function 
F{x). Only the areas with more than 10 nodes and F{x) values of more than 0.8 are visualized. 
The lobes, whose lines faster (i.e., with smaller x) reach value 1, have higher diversity. The 
fusiform gyrus and the paracentral lobule clearly moves from the bottom to the top of the 
diagram, relative to the other lines: this observation suggests that some of their edges are 
very conservative, and other areas have high diversity. An interactive version of this figure 
can be found at http://uratiin.com/diversity/Figure_2.htiiil 


3. Methods 

We have worked with a subset of the anonymized 500 Subjects Release pub¬ 
lished by the Human Connectome Project [1]: (http: //www.humcuiconnectome. 
org/documentation/S500) of healthy subjects between 22 and 35 years of age. 
Data were downloaded in October, 2014. 

We have applied the Connectome Mapper Toolkit [7] (http://cmtk.org) 
for brain tissue segmentation, partitioning, tractography and the construction 
of the graphs. The fibers were identified in the tractography step. The pro¬ 
gram FreeSurfer was used to partition the images into 1015 cortical and sub¬ 
cortical structures (Regions of Interest, abbreviated: ROls), and was based on 
the Desikan-Killiany anatomical atlas [7] (see Figure 4 in [7]). Tractography was 
performed by the Connectome Mapper Toolkit [7], using the MRtrix process¬ 
ing tool [8] and choosing the deterministic streamline method with randomized 
seeding. 

The graphs were constructed as follows: the 1015 nodes correspond to the 
1015 ROIs, and two nodes were connected by an edge if there exists at least one 
fiber connecting the ROls corresponding to the nodes. 
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48.5 
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Table 2: The diversity of the edges in different cortical areas, measured by the distribution 
function F(x). The abbreviation “ctx-lh” stands for “cortex left-hemisphere”, “ctx-rh” for 
“cortex right-hemisphere”. The areas, whose columns reach the value 1 faster (i.e., have more 
I’s at the bottom) have higher diversity. As in Table 1, the frontal regions are relatively 
more conservative, while the parietal regions are more diverse. Both precentral gyri are also 
conservative, and the postcentral and the superiortemporal gyri are more diverse. The last 
row contains the expected number of the graphs which contain a randomly chosen edge from 
the brain area indicated. Large expected number implies a conservative area, a small value 
implies a more diverse area. The data for the left hemisphere are also visualized on Figure 3 
and on an interactive figure http://uratiin.coin/diversity/Figure_3.html 
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Figure 3: The diversity of the edges in different cortical areas of the left hemisphere, measured 
by the distribution function F{x). The areas, whose lines faster (i.e., with smaller x) reach 
value 1, have higher diversity. An interactive version of this figure can be found at http: 
//uratim.com/diversity/Figure_3.html 
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3.1. The distribution function 

The variability of the edges in regions or lobes are described by cumulative 
distribution functions (CDF) (also called just the “distribution function”) of the 
edges [6]. The general definition of the CDF is as follows: 

Definition 1. Let Y be a real-valued random variable. Then 

F{x) = Pr{Y < x) 

defines the cumulative distribution function of Y for real x values. 

For example, if a is the maximum value of Y then F(a) = 1, and if b is less 
than the minimum value of Y, then F{b) = 0. 

CDFs are used the following way: Suppose that our cohort consists of n 
persons’ braingraphs (in the present work n = 392). For a given, fixed brain 
area, our random variable Y takes on values Y = u/n,u = 0,1,...,n. The 
equation Y = ujn corresponds to the event that a uniformly, randomly chosen 
edge is in exactly u graphs from the n possible one, and the probability Pr(Y = 
u/n) gives the probability of this event. Or, in other words, the equation Y = 
u/n corresponds to the set of edges — with both nodes in the fixed brain area 
— which are present in exactly u braingraphs, and the probability Pr{Y = 
u/n) gives the fraction of the edges that are present in exactly u braingraphs. 
Therefore, F{x) = Pr{Y < x) gives the fraction (i.e., the probability) of the 
edges that are present in at most of a fraction x of all the graphs. 

The number of nodes and edges in each brain regions are given in supporting 
Tables SI and S2 in the Appendix. We remark that we counted the edges 
without multiplicities: that is, if an edge e was either present in, say, 42 copies 
or just 1 copy of the braingraph, in both cases we counted it only once. 

The distributions were computed by counting the number of appearances of 
each edge in all the 392 braingraphs. Then the distribution of these numbers 
were evaluated in lobes and smaller cortical areas. 

4. Conclusions: 

By our knowledge for the first time, we have mapped the inter-individual 
variability of the braingraph edges in different cortical areas. We have found 
more and less conservative areas of the brain: for example, frontal lobes are 
conservative, superiortemporal and the post-central gyri are very diverse. The 
fusiform gyrus and the paracentral lobule have shown both conservative and 
diverse distributions, depending on the range of the parameters. 


Data availability: 


The unprocessed and pre-processed MRI data are available at the Human 
Connectome Project’s website: 

http://www.humanconnectome.org/documentation/S500 [1]. 
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The assembled graphs that were analyzed in the present work can be accessed 
and downloaded at the site 

http://braingraph.org/download-pit-group-connectomes/. 
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Appendix 

Abbreviations: ctx-rh: cortex right-hemisphere ctx-lh: cortex left- 

hemisphere 


Area name 

No. Of nodes 



ctx-lh-superiorfrontal 

45 

ctx-rh-superiorfrontal 

42 

ctx-rh-precentral 

36 

ctx-lh-precentral 

35 

ctx-lh-postcentral 

31 

ctx-rh-postcentral 

30 

ctx-lh-superiorparietal 

29 

ctx-rh-superiorparietal 

29 

ctx-rh-rostralmiddlefrontal 

27 

ctx-lh-superiortemporal 

26 

ctx-lh-rostralmiddlefrontal 

26 

ctx-rh-inferiorparietal 

26 

ctx-rh-superiortemporal 

25 

ctx-rh-lateraloccipital 

23 

ctx-rh-precuneus 

23 

ctx-lh-lateraloccipital 

23 

ctx-lh-precuneus 

22 

ctx-lh-inferiorparietal 

22 

ctx-lh-supramarginal 

21 

ctx-rh-supramarginal 

20 

ctx-rh-middletemporal 

19 

ctx-lh-fusiform 

18 

ctx-rh-lateralorbitofrontal 

17 

ctx-rh-fusiform 

17 

ctx-rh-lingual 

17 

ctx-lh-insula 

17 

ctx-lh-lingual 

17 

ctx-lh-inferiortemporal 

16 

ctx-rh-insula 

16 

ctx-rh-inferiortemporal 

16 

ctx-lh-middletemporal 

16 

ctx-lh-lateralorbitofrontal 

16 
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ctx-lh-caudalmiddlefrontal 

13 

ctx-rh-paracentral 

12 

ctx-lh-paracentral 

11 

ctx-rh-caudalmiddlefrontal 

11 

ctx-rh-medialorbitofrontal 

11 

ctx-lh-medialorbitofrontal 

10 

ctx-lh-parsopercularis 

10 

ctx-lh-posteriorcingulate 

9 

ctx-rh-posteriorcingulate 

9 

ctx-rh-parsopercularis 

9 

ctx-rh-parstriangularis 

8 

ctx-rh-cuneus 

8 

ctx-rh-pericalcarine 

8 

ctx-lh-cuneus 

7 

ctx-lh-pericalcarine 

7 

ctx-lh-isthmuscingulate 

7 

ctx-lh-parstriangularis 

7 

ctx-rh-parahippocampal 

6 

ctx-lh-bankssts 

6 

ctx-rh-caudalanteriorcingulate 

6 

ctx-rh-isthmuscingulate 

6 

ctx-lh-parahippocampal 

6 

ctx-rh-bankssts 

6 

ctx-lh-rostralanteriorcingulate 

5 

ctx-lh-caudalanteriorcingulate 

5 

ctx-rh-parsorbitalis 

4 

ctx-lh-transversetemporal 

4 

ctx-lh-parsorbitalis 

4 

ctx-rh-rostralanteriorcingulate 

4 

ctx-lh-entorhinal 

3 

ctx-lh-temporalpole 

3 

ctx-rh-temporalpole 

3 

ctx-rh-transversetemporal 

3 

ctx-lh-frontalpole 

2 

ctx-rh-entorhinal 

2 

ctx-rh-frontalpole 

2 

Left-Thalamus-Proper 

1 

Left-Amygdala 

1 

Right-Hippocampus 

1 

Right-Amygdala 

1 

Right-Putamen 

1 

Right-Accumbens-area 

1 

Left-Hippocampus 

1 

Left-Pallidum 

1 

Right-Pallidum 

1 

Right-Thalamus-Proper 

1 

Left-Putamen 

1 
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Right-Caudate 

1 

Left-Caudate 

1 

Left-Accumbens-area 

1 

Brain-Stem 

1 



Sum of nodes 

1015 


Table SI: The number of nodes in each ROI. 



basalgangita 0.8% | 
thalamus 0.2% | 
Srain-Stem 0.1% I 


Figure SI: The number of nodes in ROIs and lobes. The interactive figure can be viewed at 
http: //uratim. coin/diversity/Figure_Sl-Krona.html 


’all’ 

70652 

’ctx-lh-superiorfrontal’ 

910 

’ctx-rh-superiorfrontal’ 

774 

’ctx-rh-precentral’ 

500 

’ctx-lh-precentral’ 

448 

’ctx-rh-rostralmiddlefrontal’ 

352 

’ctx-rh-inferiorparietal’ 

340 

’ctx-lh-rostralmiddlefrontal’ 

331 
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’ctx-lh-superiorparietal’ 

317 

’ctx-rh-superiorparietal’ 

314 

’ctx-lh-postcentrar 

305 

’ctx-rh-postcentral’ 

273 

’ctx-rh-lateraloccipital’ 

263 

’ctx-lh-lateraloccipital’ 

254 

’ctx-lh-superiortemporar 

250 

’ctx-lh-inferiorparietal’ 

242 

’ctx-rh-superiortemporal’ 

228 

’ctx-rh-precuneus’ 

227 

’ctx-lh-precuneus’ 

222 

’ctx-lh-supramarginal’ 

209 

’ctx-rh-supramarginar 

206 

’ctx-rh-middletemporar 

176 

’ctx-lh-fusiform’ 

157 

’ctx-rh-lateralorbitofrontar 

144 

’ctx-lh-inferiortemporal’ 

135 

’ctx-rh-insula’ 

131 

’ctx-lh-lingual’ 

131 

’ctx-rh-fusiform’ 

130 

’ctx-rh-inferiortemporal’ 

130 

’ctx-lh-lateralorbitofrontal’ 

127 

’ctx-lh-insula’ 

125 

’ctx-lh-middletemporal’ 

119 

’ctx-rh-lingual’ 

114 

’ctx-lh-caudalmiddlefrontal’ 

91 

’ctx-rh-paracentral’ 

76 

’ctx-rh-caudalmiddlefrontal’ 

65 

’ctx-lh-paracentral’ 

64 

’ctx-rh-medialorbitofrontal’ 

59 

’ctx-lh-parsopercularis’ 

55 

’ctx-lh-medialorbitofrontal’ 

54 

’ctx-lh-posteriorcingulate’ 

45 

’ctx-rh-parsopercularis’ 

45 

’ctx-rh-posteriorcingulate’ 

43 

’ctx-rh-parstriangularis ’ 

36 

’ctx-rh-cuneus’ 

35 

’ctx-rh-pericalcarine’ 

35 

’ctx-lh-cuneus’ 

28 

’ctx-lh-pericalcarine’ 

28 

’ctx-lh-isthmuscingulate’ 

28 

’ctx-lh-parstriangularis’ 

28 

’ctx-lh-bankssts’ 

21 

’ctx-rh-caudalanteriorcingulate’ 

21 

’ctx-lh-parahippocampal’ 

21 

’ctx-rh-parahippocampal’ 

20 

’ctx-rh-isthmuscingulate ’ 

20 
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’ctx-rh-bankssts’ 

20 

’ctx-lh-rostralanteriorcingulate’ 

15 

’ctx-lh-candalanteriorcingnlate’ 

15 

’ctx-rh-parsorbitalis’ 

10 

’ctx-lh-parsorbitalis’ 

10 

’ctx-rh-rostralanteriorcingulate’ 

10 

’ctx-lh-transversetemporal’ 

8 

’ctx-lh-entorhinal’ 

6 

’ctx-rh-transversetemporal’ 

5 

’ctx-lh-temporalpole’ 

4 

’ctx-rh-entorhinal’ 

3 

’ctx-rh-temporalpole’ 

3 

’Left-Thalamus-Proper’ 

1 

’Left-Amygdala’ 

1 

’ctx-lh-frontalpole’ 

1 

’Right-Hippocampus’ 

1 

’Right-Amygdala’ 

1 

’ctx-rh-frontalpole’ 

1 

’Right-Putamen’ 

1 

’Right-Accumbens-area’ 

1 

’Left-Hippocampus’ 

1 

’Left-Pallidum’ 

1 

’ Right- Pallidum ’ 

1 

’ Right- T halamus-Proper ’ 

1 

’Left-Putamen’ 

1 

’Right-Caudate’ 

1 

’Left-Caudate’ 

1 

’ Left-Accumbens-area’ 

1 

’Brainstem’ 

1 


Table S2: The number of edges in each ROI. 
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